Corpus-Centered Computation
نویسنده
چکیده
To achieve translation technology that is adequate for speech-to-speech translation (S2S), this paper introduces a new attempt named Corpus-Centered Computation, (abbreviated to C and pronounced c-cube). As opposed to conventional approaches adopted by machine translation systems for written language, C places corpora at the center of the technology. For example, translation knowledge is extracted from corpora, translation quality is gauged by referring to corpora and the corpora themselves are normalized by paraphrasing or filtering. High-quality translation has been demonstrated in the domain of travel conversation, and the prospects of this approach are promising due to the benefits of synergistic effects.
منابع مشابه
A Multidisciplinary Student-centered Laboratory
1 Texas A&M University -Corpus Christi Department of Computing and Mathematical Sciences, [email protected], [email protected], [email protected], [email protected] Abstract In this paper we describe a student-centered laboratory developed by the Department of Computing and Mathematical Sciences at Texas A&M University – Corpus Christi and partially supported...
متن کاملA corpus-centered approach to spoken language translation
This paper reports the latest performance of components and features of a project named CorpusCentered Computation (C'3), which targets a translation technology suitable for spoken language translation. C3 places corpora at the center of the technology. Translation knowledge is extracted from corpora by both EBMT and SMT methods, translation quality is gauged by referring to corpora, the best t...
متن کاملEBMT, SMT, hybrid and more: ATR spoken language translation system
This paper introduces ATR’s project named Corpus-Centered Computation (C3), which aims at developing a translation technology suitable for spoken language translation. C3 places corpora at the center of its technology. Translation knowledge is extracted from corpora, translation quality is gauged by referring to corpora, the best translation among multiple-engine outputs is selected based on co...
متن کاملHuman-Centered Analysis and Visualization Tools for the Blogosphere
Blogging has become a new and disruptive communication medium. Blogs have changed the way people and organizations express, interact, and—quite unforeseen—exercise influence. The digital nature of the blog media provides access to an always-expanding corpus of information. It would take more than a lifetime to read all the available blogs necessary to answer questions such as what were the more...
متن کاملModeling Narrative-Centered Tutorial Decision Making in Guided Discovery Learning
Interactive narrative-centered learning environments offer significant potential for scaffolding guided discovery learning in rich virtual storyworlds while creating engaging and pedagogically effective experiences. Within these environments students actively participate in problem-solving activities. A significant challenge posed by narrative-centered learning environments is devising accurate...
متن کامل